AITopics | rl experiment

Collaborating Authors

rl experiment

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)
Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

c32de883c5fe94d33a20a717fad53971-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 16:32:31 GMT

artificial intelligence, machine learning, operator, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

08425b881bcde94a383cd258cea331be-AuthorFeedback.pdf

Neural Information Processing SystemsOct-9-2025, 13:05:41 GMT

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)
Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

3fe78a8acf5fda99de95303940a2420c-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 18:16:54 GMT

artificial intelligence, experiment, pcgrad, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.34)

Add feedback

12ffb0968f2f56e51a59a6beb37b2859-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 03:32:11 GMT

We thank the reviewers for their insights and suggestions. Answers below will be included in expanded discussions in future versions of the paper. In the case of R3's car example, as long as states from 10 steps into the future are sampled This is discussed in L211-L215 in Section 6 "Practical Training of γ -Models". The only Monte Carlo trajectory estimates are in the final column for comparison.

experiment, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.32)

Add feedback

We thank R2 and R3 for their vote of confidence and giving this work at high score of 9 and 8 respectively

Neural Information Processing SystemsAug-22-2025, 02:23:20 GMT

We thank all our reviewers for their feedback! We will respond to (R2, R3) separately to R1 due to different concerns. We thank R2 and R3 for their vote of confidence and giving this work at high score of 9 and 8 respectively. It means a lot to us - to see our ideas accepted by our peers at NeurIPS who also believe that our "work opens many new We experimented with setting all weights to a single fixed value, e.g. However, if we then nudge that value by a small amount, to say 0.6, the network fails completely at the In fact, the best performing values were outside of this training set. We will cite and discuss this work in our revised paper. NeurIPS2019 will discuss similar themes and we are excited to see more ideas in this direction from both communities. We agree with R3 that scaling up is the next step. Stanley2009) to scale W ANNs architectures to scales able to compete on benchmarks such as ImageNET and Atari. We wish to take the time to conduct this investigation thoroughly, and plan to report the findings in a follow up paper. W ANNs. We would also like to thank R3 for the other minor suggestions, we will clarify the labels and information. In the spirit of this extreme experiment the algorithm used was purposefully kept simple. Our original intention was to focus only on continuous-control RL experiments, and decided to run MNIST "for fun" We could have confined the paper to only RL experiments (most RL papers don't run MNIST Finally, we do believe there is a connection to the neuroscience field. "What Artificial Neural Networks can Learn from Animal Brains" (Zador2019) whose central theme is that "The first

contribution, experiment, high score, (8 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback